Multiscale keypoint hierarchy for Focus-of-Attention and object detection

نویسنده

  • J Rodrigues
چکیده

Hypercolumns in area V1 contain frequencyand orientation-selective simple and complex cells for line (bar) and edge coding, plus end-stopped cells for keypoint (vertex) detection. A single-scale (single-frequency) mathematical model of single and double end-stopped cells on the basis of Gabor filter responses was developed by Heitger et al. (1992 Vision Research 32 963-981). We developed an improved model by stabilising keypoint detection over neighbouring microscales. Because of the many filter scales represented by simple and complex cells, it is likely that, apart from a multi-scale line/edge representation, the visual cortex also constructs a multi-scale keypoint representation over multiple frequency octaves. Simulations with many different objects showed that, at very coarse scales, keypoints are found near the centre (centroid) of the objects. At medium scales, keypoints are detected at important parts of objects, for example the ”fingers”of plant leaves, whereas at finest scales they are found at points of high curvature on the contour. In other words, the multi-scale keypoint representation offers a hierarchical structure in terms of object, sub-objects and contour. In addition, a retinotopic summation of all detected keypoints over all scales provides one map with peaks caused by keypoints that are stable over many scales, and this map can be used as a saliency map for Focus-of-Attention. Further experiments showed that, for example, face detection can be achieved by grouping keypoints at expected positions (eyes, nose, mouth), taking into account symmetries and distances, and by combining suitable scales. Hence, position, rotation and scale invariant face detection may be achieved by embedding the multi-scale keypoint representation, in addition to the line/edge representation, into feedforward and feedback streams to/from higher areas V2, V4 and IT (what or parvo system), whereas the saliency map for FoA interacts with short-term memory via areas PP and MT (where or magno system). [Supported by PRODEP III Medida 5-Acção 5.3]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-scale Cortical Keypoint Representation for Attention and Object Detection

Keypoints (junctions) provide important information for focus-of-attention (FoA) and object categorization/recognition. In this paper we analyze the multi-scale keypoint representation, obtained by applying a linear and quasi-continuous scaling to an optimized model of cortical end-stopped cells, in order to study its importance and possibilities for developing a visual, cortical architecture. ...

متن کامل

Log-Spiral Keypoint: A Robust Approach toward Image Patch Matching

Matching of keypoints across image patches forms the basis of computer vision applications, such as object detection, recognition, and tracking in real-world images. Most of keypoint methods are mainly used to match the high-resolution images, which always utilize an image pyramid for multiscale keypoint detection. In this paper, we propose a novel keypoint method to improve the matching perfor...

متن کامل

A 3D Keypoint Detector based on Biologically Motivated Bottom-Up Saliency Map

We present a new method for the detection of 3D keypoints on point clouds and we perform benchmarking between each pair of 3D keypoint detector and 3D descriptor to evaluate their performance on object and category recognition. Our keypoint detector is inspired by the behavior and neural architecture of the primate visual system. The 3D keypoints are extracted based on a bottom-up 3D saliency m...

متن کامل

Multiscale Significance Run: Realizing the ‘Most Powerful’ Detection in Noisy Images

Detection is a fundamental problem in many applications. In many cases, knowing the presence of underlying objects is of significant importance. Multiscale methods have been demonstrated to be advantageous in solving this problem. Besides theoretical results that have been achieved, this paper discusses how the ‘most powerful’ detection can be realized, for a set of specifically organized under...

متن کامل

Multi-scale Keypoints in V1 and Face Detection

End-stopped cells in cortical area V1, which combine outputs of complex cells tuned to different orientations, serve to detect line and edge crossings (junctions) and points with a large curvature. In this paper we study the importance of the multi-scale keypoint representation, i.e. retinotopic keypoint maps which are tuned to different spatial frequencies (scale or Level-of-Detail). We show t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005